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Abstract The Conway-Maxwell-Poisson (CMP) distribution is a natural two-parameter 
generalisation of the Poisson distribution which has received some attention in the statis¬ 
tics literature in recent years by offering flexible generalisations of some well-known mod¬ 
els. In this work, we begin by establishing some properties of both the CMP distribution 
and an analogous generalisation of the binomial distribution, which we refer to as the 
CMB distribution. We also consider some convergence results and approximations, in¬ 
cluding a bound on the total variation distance between a CMB distribution and the 
corresponding CMP limit. 
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1 Introduction 


A two-parameter generalisation of the Poisson distribution was introduced by Conway 
and Maxwell 1^ as the stationary number of occupants of a queuing system with state 
dependent service or arrival rates. This distribution has since become known as the 
Conway-Maxwell-Poisson (CMP) distribution. Beginning with the work of Boatwright, 
Borle and Kadane and Shmueli et ah M], the CMP distribution has received recent 
attention in the statistics literature on account of the flexibility it offers in statistical 
models. For example, the CMP distribution can model data which is either under- or 
over-dispersed relative to the Poisson distribution. This property is exploited by Sellers 
and Shmueli [^, who use the CMP distribution to generalise the Poisson and logistic 
regression models. Kadane et ah 2^ considered the use of the CMP distribution in 


Bayesian analysis, and Wu, Holan and Wilkie (S^ use the CMP distribution as part of a 
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Bayesian model for spatio-temporal data. The CMP distribution is employed in a flexible 
cure rate model formulated by Rodrigues et al. 28| and further analysed by Balakrishnan 
and Pal [^. 

Our purpose in this work is twofold. Motivated by the use of the CMP distribution 
in the statistical literature, we hrstly aim (in Section [2]) to derive explicit distributional 
properties of the CMP distribution and an analogous generalisation of the binomial dis¬ 
tribution, the CMB distribution. Our second aim is to consider the CMP distribution as 
a limiting distribution. We give conditions under which sums of dependent Bernoulli ran¬ 
dom variables will converge in distribution to a CMP random variable, and give an explicit 
bound in total variation distance between the CMB distribution and the corresponding 
CMP limit. These convergence results are detailed in Sections |3] and IH 

We use the remainder of this section to introduce the CMP and CMB distributions 
and collect some straightforward properties which will prove useful in the sequel. We also 
introduce some further dehnitions that we will need in the work that follows. 


1.1 The CMP distribution 

The CMP distribution is a natural two-parameter generalisation of the Poisson distribu¬ 
tion. We will write X ~ CMP(A, u) if 


P(W = j) = 


A^' 


Z{X,iy) {j\r^ 

where Z{X, v) is a normalizing constant dehned by 


jez+ = {0,i,2,...}. 


( 1 . 1 ) 


^(A, .)=$:- 


A* 


^ {i\y 

i=0 ^ ’ 


The domain of admissible parameters for which (11.11) dehnes a probability distribution is 
A, > 0, and 0 < A < 1, = 0. 

The introduction of the second parameter v allows for either sub- or super-linear 
growth of the ratio P(W = j — 1)/P(W = j), and allows X to have variance either less 
than or greater than its mean. Of course, the mean of X ~ CMP (A, u) is not, in general, 
A. In Section [2] we will consider further distributional properties of the CMP distribution, 
including expressions for its moments. 

Clearly, in the case where u = 1, X ^ CMP(A, 1) has the Poisson distribution Po(A) 
and the normalizing constant Z(X, 1) = e^. As noted by Shmueli et al. [^, other choices 
of u also give rise to well-known distributions. For example, in the case where z/ = 0 and 
0 < A < 1, W has a geometric distribution, with Z(A, 0) = (1 — A)“^. In the limit z/ ^ oo, 
X converges in distribution to a Bernoulli random variable with mean A(1 -|- A)“^ and 
\im^^ooZ{X,u) = 1 -F A. 

In general, of course, the normalizing constant Z(A, z/) does not permit such a neat, 
closed-form expression. Asymptotic results are available, however. Gillispie and Green 


17l | prove that, for hxed u, 


Z{X,iy) 


__ (1+0 (A-'/-)) 


( 1 . 2 ) 
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as A —)■ oo, confirming a conjecture made by Shmueli et al. j^. This asymptotic result 
may also be used to obtain asymptotic results for the probability generating function of 
X ~ CMP(A, z/), since it may be easily seen that 


. ^ Z{sX,u) 
Z{X,u) • 


(1.3) 


1.2 The CMB distribution 


Just as the CMP distribution arises naturally as a generalisation of the Poisson dis¬ 
tribution, we may define an analogous generalisation of the binomial distribution. We 
refer to this as the Conway-Maxwell-binomial (CMB) distribution and write that Y ~ 
CMB(n,p, u) if 




where n G N = {1, 2,...}, 0 < p < 1 and z/ > 0. The normalizing constant is dehned 


by 



The dependence of on p and z/ is suppressed for notational convenience. Of course, the 
case z/ = 1 is the usual binomial distribution Y ~ Bin(n,p), with normalizing constant 

= 1. Shmueli et al. considered the CMB distribution and derived some of its 
basic properties, referring to it as the CMP-binomial distribution. We, however, consider 
it more natural to refer to this as the CMB distribution (a similar convention is also 
followed by Kadane j^); we shall also later refer to an analogous generalisation of the 
Poisson binomial distribution as the CMPB distribution. 

There is a simple relationship between CMP and CMB random variables, which 
generalises a well-known result concerning Poisson and binomial random variables. If 
Xi ~ CMP(Ai,z/) and X 2 ~ CMP(A 2 ,z/) are independent, then Xi\Xi + X 2 = n ^ 
CMB(n, Ai/(Ai-|-A 2 ), z/) ee 0 ). 



It was also noted by that Y ~ CMB(n,p, v) may be written as a sum of exchange¬ 
able Bernoulli random variables Zi,..., satisfying 



(1.4) 


where k = zi + ■ ■ ■ + Zn- Note that EZi ^ p in general, unless z^ = 1. However, 
EZi = n~^WY may be either calculated explicitly or estimated using some of the properties 
of the CMB distribution to be discussed in the sequel. 

From the mass functions given above, it can be seen that if H ~ CMB(n, A/n*^, z^), 
then Y converges in distribution to W ~ CMP(A, z/) as n —?■ 00. We return to this 
convergence in Section [3] below, where we give an explicit bound on the total variation 
distance between these distributions. 
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1.3 Power-biasing 


In what follows, we will need the definition of power-biasing, as nsed by Pekoz, Rollin 


and Ross [27|]. For any non-negative random variable W with hnite z/-th moment, we say 


that has the z/-power-biased distribution of W if 


(Eiy'^)E/(IF(^)) =E[W’'f{W)] , 


(1.5) 


for all / ; M’*' M such that the expectations exist. In this paper, we will be interested 
in the case that W is non-negative and integer-valued. In this case, the mass function of 
is given by 

P(IRF) = j) = = j) j G Z+ . 

^ ^ EW^' 

Properties of a large family of such transformations, of which power-biasing is a part, are 
discussed by Goldstein and Reinert [l^. The case z/ = 1 is the usual size-biasing, which 
has often previously been employed in conjunction with the Poisson distribution: see 
Barbour, Holst and Janson j^, Daly, Lefevre and Utev [l^, Daly and Johnson [l^, and 
references therein for some examples. The power-biasing we employ here is the natural 
generalisation of size-biasing that may be applied in the CMP case. 


2 Distributional properties of the CMP and CMB 
distributions 

In this section we collect some distributional properties of the CMP and CMB distribu¬ 
tions. Some will be required in the sequel when considering approximations and conver¬ 
gence to the CMP distribution, and all of are of some interest, either independently or 
for statistical applications. 

2.1 Moments, cumulants, and related results 

We begin this section by noting, in Proposition 12.11 below, that some moments of the 
CMP distribution may be easily and expli citly calculated. The simple formula EX'^ = A 
was already known to Sellers and Shmueli |29|. We also note the corresponding result for 
the CMB distribution. 

Here and in the sequel we let 

{j)r = j{j - 1) • • • (j - r + 1) 


denote the falling factorial. 

Proposition 2.1. (i). Let X ~ CMP{X,u), where A,z/ > 0. Then 

E[{{X\Y]=X\ 


for r G N. 
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(ii). Let Y ~ CMB{n,p,u), where z/ > 0. Then 

E|((y).)i = ^((>i),)v, 

for r = l,...,?7, — 1. 

Proof. We have 


E|((A'),)‘'] 


z(\,v: 

y 




k=0 

oo 


{k\y 






Z(A,-')£((*->■)!)- 


and 


E[((ywi = 7:rE(W 


” fc=0 ^ 


n! 


Cn\{n — r) \ J \ k — r 




n—k 




i=o 


" ’■V(i-pr-’-^ = %^((nwv 


J 


a 


□ 


Remark 2.2. It is well-known that the factorial moments of Z ~ Po(A) are given by 
E[(X)r] = A'’. We therefore have the attractive formula E[((X)r)'^] = E[(Z)r], for X ~ 
CMP(A,i/). 

Such simple expressions do not exist for moments of W ~ CMP(A,z/) which are not 
of the form E[((W)r)^]. Instead, we use fll.21) to give asymptotic expressions for such 
moments. 


Proposition 2.3. Let X ~ CMP{X, v). Then, for k E N, 

EX^ ~ A^/'^(l + 0(A-P'^)) , 

as X ^ oo. 


Proof. It is clear that, for k E N, 


Differentiating fll.2p (see Remark 12.41 for a justihcation) we have that 

0 ~ A‘/- . (1 + O (A-/-)) . 


( 2 . 1 ) 
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as A —)■ cxo, and hence 


E [( X ) fe ] ~ A'=/"(1 + 0 ( A -'/")) , 


as A oo. We now exploit the following connection between moments and factorial 
moments: 


r=l 


E|(A'), 


( 2 , 2 ) 


for fc G N, where the Stirling numbers of the second kind are given by = 

see Olver et ah 2^). Using fl2.2p . and noting that = 1, com- 

□ 


pletes the proof. 


Remark 2.4. In the above proof, we differentiated the asymptotic formula fll.2p in the 
naive sense by simply differentiating the leading term k times. We shall also do this below 
in deriving the variance formula (12.4p . and in Proposition 12.6[ in which we differentiate an 
asymptotic series for log(Z(Ae*, z/)) with respect to t in an analogous manner. However, 
as noted by Hinch j^, p. 23, asymptotic approximations cannot be differentiated in this 
manner in general. Fortunately, in the case of the asymptotic expansion (11.2p for Z{X, u) 
we can do so. This is because we have the following asymptotic formula for the CMP 
normalising constant that is more precise than (II.2p . For hxed u, 


Z(A, ly) ~ 


exp{z/Ai/^} 

A(-i)/2-(27r)U-P/2yF 


1 + OfcA 


k=l 



(2.3) 


as A —)■ oo, where the are constants that do not involve A. The m-th derivative of the 
asymptotic series (12. 3 p is dominated by the m-th derivative of the leading term of (12.31) . 
meaning that one can naively differentiate the asymptotic series, as we did in the proof 
of Proposition 12.31 

The leading term in the asymptotic expansion (12. 3 p was obtained for integer ly by 
Shmueli et ah [UtLand then for all > 0 by Gillispie and Green 0, When stating their 
results, 3l| and |l7| did not include the lower order term but it can be 

easily read off from their analysis. For integer z/, gave an integral representation for 


Z(A, ly) and then applied Laplace’s approximation to write down the leading order term in 
its asymptotic expansion. Laplace’s approximation gives that (see Shun and McCullagh 
j^, p. 750), for inhnitely differentiable g : ^ W, 


exp{—n( 7 (a;)} dx 


n det{g'' 

2^ 


' 1/2 


exp{-ng}il + ^bkn ^ 


Zc=l 


as n —)■ cxo , 


where the bk do not involve n, and g and g" denote g and the matrix of second order 
derivatives of g, respectively, evaluated at the value x that minimises g. It is now clear 
that the lower order term in (j2.3p has the form YlT-i Fon general z^ > 0, 0 

obtained an expression for the leading term in the asymptotic expansion of Z{A, v) by us¬ 
ing Laplace’s approximation, as well as several other simpler asymptotic approximations. 
In each of these approximations, the lower order term is of the form czcA“^/^, from 

which it follows that Z{A, ly) has an asymptotic expansion of the form (12.3p . 
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We also have the following relationship between moments of X ~ CMP (A, z/): 


= a4-EX" + EXEX" , 
a A 


for r > 0. See eqnation (6) of Shmneli et ah Ml- With r = 1 we obtain 


Var(X) = A^EX ~ -A^/" + 0(1), 
a A z/ 

as A —)■ cxo, from Proposition 12.31 This also gives the following corollary. 
Corollary 2.5. Let m be the median of X ^ CMP{X,n). Then 

m ~ A*'" + O (A’-'^-) , 


(2.4) 


as A —)■ cxo. 

Proof. From above, EX ~ + 0(1) and a = ■\/Var(X) ~ + 0(1). Since 

cr < oo, we may use a result of Mallows j^, who showed that |EX — m\ < a. The result 
follows. □ 


As with the moments above, we may also hnd asymptotic expressions for the cumulants 
of the CMP distribution. 


Proposition 2.6. Porn > 1, let Kn be the nth cumulant of X CMP{X,u). Then 


Kr, 


,71—1 


A'/"+ 0(1), 


as X ^ oo. 


Proof. From fll.3l) . the cumulant generating function of X ~ CMP (A, z/) is 


g{t) = log(E[e*^]) = log(Z(Ae*, u)) - log(Z(A, z/)). 


The cumulants are given by 

Kn = ^^"^0) = ^ log(Z(Ae*, z/)) 


From (II.2p . 

log(Z(Ae*,z/)) ~ z/A^/V/", 

as A —?■ cxD. The expression for the leading term in the asymptotic expansion of Kn now 
easily follows, and a straightforward analysis, which is omitted, shows that the second 
term is 0(1) for all n > 1. The result now follows. □ 
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Note that as a corollary to this result, the skewness 71 of X ~ CMP(A, u) satishes 


1-1 = ^ ~ + O (A-*/^-) , 


as A ^ 00 , where = Var(X) ~ from fl2.4p . Similarly, the excess kurtosis 72 of X 

satishes 

K4 1 

7a 


X-^/'^ + O (A-2/^) , 

0 ” z/ ^ 

as A —)■ 00 . For comparison, recall that in the Poisson case (z^ = 1), 71 = A“^/^ and 

72 = A"h 

We conclude this section with two further results of a similar havour. We begin by 
giving expressions for the modes of the CMP and CMB distributions. The expression for 
the mode of CMP distribution for non-integral A^/^ was known to Guikema and Goffelt 


20| . but for clarity and completeness we state the result and give the simple proof. The 


expression for the mode of the CMB distribution is new. Here and in the sequel we will 
let [-J denote the hoor function. 

Proposition 2.7. (i). Let X ~ CMP{X,i'). Then the mode of X is if X^^^ is not 

an integer. Otherwise, the modes of X are X^^'^ and A^/^ — 1. 


(a). LetY ~ CMB{n,p,p) and define 


a = 


n + 1 


1 + 

' p 


llu 


Then the mode of Y is [aj if a is not an integer. Otherwise, the modes of Y are a 
and a — 1. 


Proof, (i). Writing 


F{X=j) = 


Z{X,iy) 


(AV 




J'- 


the result now follows as in the Poisson case, for which the result is well-known. 

(ii). This is a straightforward generalisation of the derivation of the mode of a binomial 
distribution given by Kaas and Buhrman 22(|. Consideration of the ratio 


P(y = k + 1 ) f n — k 


p(y = k) 


k + 1 


P 


n — k 


P 


k + 1 \1 — p 


P 




shows that P(y = k) increases as a function of k ii k < a and decreases for k > a — 1 . 
Therefore, if a is not an integer, P(y = k) increases for k < [aJ and decreases for 
k > [aJ, giving [aJ as the mode. If a is an integer then P(y = k) increases for 
k < a — 1 and decreases for k > a and so a — 1 and a are neighbouring modes. 

□ 
















We also give, in Proposition 12.81 below, an expression for the mean deviation of X^, 
where X ~ CMP(A, u), as usual. This generalises a result of Crow [ll|, who showed that 
if Z ~ Po(A) has a Poisson distribution, then 

aPJ+^ 

E|Z-A|=2e-"-p^ = 2AS, 

where S = is the maximum value of the mass function of Z. 

Proposition 2.8. Let X ~ CMP{\^u). Then 

E|X- - A| = 2Z(A, = 2AT , 

where T = Z{X, u)~^ maximum value of the mass function of X. 

Proof. 

\k 

"‘(W 

- yk °° \ A: 

= z(A,o-‘ E 


E|X^-A| = A|Z(A,i/)- 


k=0 


k=\\^/''\+l 


{k\y 


= 2Z{\,u 


xLAV-J+i 

_ 

[AV-J! • 


□ 


2.2 Characterisations 


In Section 3, we will use Stein’s method for probability approximations (see, for example, 
Stein or Chen 0 ) to give bounds on the convergence of the CMB distribution to a 


suitable CMP limit. Stein’s method relies on linear operators characterising distributions 
of interest. In the following lemma, we present such characterisations for the CMP and 
CMB distributions. These will also prove useful in deriving several other properties of 
these distributions in the work that follows. 


Lemma 2.9. We have the following characterisations for the CMP and CMB distribu¬ 
tions. 

(i). Let X ~ CMP(\,u), and suppose that f : IC M zs such that E|f(X + 1)1 < cx) 
and E|X^/(X)I < oo. Then 

E[A/(X + 1)-XV(X)] = 0. (2.5) 

Conversely, suppose now that W is a real-valued random variable supported on IX 
such that E[Xf{W + 1) - W'fiW)] = 0 for all bounded / : Z+ ^ M. Then W ~ 
CMP{X,u). 
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(ii). Let Y ~ CMB{n,p, v), and suppose that / : Z+ i—)■ M zs such that E|/(y + 1)| < cxo 
and¥.\Y'^f{Y)\<oo. Then 

E[p(n - YYfiY + 1) - (1 - p)Y'^f{Y)] = 0. (2.6) 

Proof. The characterising equations fl2.5p and fl2.6p may be obtained directly through 
straightforward manipulations, or from the work of Brown and Xia jsj , for example, who 
consider such characterisations in the more general setting of the equilibrium distribution 
of a birth-death process. To prove the converse statement for the CMP distribution in 
part (i), we consider the so-called Stein equation for the CMP distribution: 

I{x eA)- P(X e A) = XfA{x -Fl) - x-^fAix), (2.7) 

where X ~ CMP (A, n), A Y 7A and Ja '■ TA i—>■ R. In Lemma (3.71 below, we will obtain 
the unique solution to fl2.7p and prove that it is bounded. Since is bounded, evaluating 
both sides of fl2.7p at W and taking expectations gives that, for any A C Z+, 

¥{w e 71) - p(x: e 71) = E[A/A(hP +1) - W'^fAiW)] = o, 

from which it follows that W ~ CMP (A, v). □ 

We do not give a converse statement, and thus a complete characterisation, for the 
CMB distribution. We do not need the converse in this paper, and giving a proof analogous 
given to that for CMP distribution would be tedious, because we would need to solve the 
corresponding CMB Stein equation and then derive bounds for the solution. 


2.3 Stochastic ordering and related results 


In this section we will explore some properties of CMP and CMB distributions that may be 
obtained by considering various stochastic orderings. In particular, we will use the usual 
stochastic order and the convex order. For random variables U and V, we say that U is 
smaller than V in the usual stochastic order (which we denote [/ <st V) if E/(17) < E/(17) 
for all increasing functions /. Equivalently, U <st V if F{U > t) < P(17 > t) for all t. 
For random variables U and V with Ef/ = E17, we will say that U is smaller than V in 
the convex order (written U <cx V) if E/(17) < E/(17) for all convex functions /. Many 
further details on these orderings may be found in the book by Shaked and Shanthikumar 
30l |. for example. 

We begin with two lemmas that make use of the power biasing introduced in Section 

0 


Lemma 2.10. Let W be a non-negative random variable and 0 < a < /3. Suppose that 
EVF" and EhF^ exist. Then <st W^h) ^ 

Proof. It is easily checked, using the dehnition fll.Sp . that for a,(5 > 0, =st 

WA+^\ Taking 5 = (3 — a > 0, it therefore suffices to prove the lemma with a = 0. That 
is, we need to show that E/(hF) < E/(hF^^i) for all /9 > 0 and all increasing functions /. 
From the dehnition fll.Sp this is immediate, since Cov(hF^,/(IF)) > 0 for all increasing 
/■ □ 
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Lemma 2.11. Let Y ~ CMB{n,p,i'). Then <st L' + 1. 
Proof. Note that 


¥{Y + l=j) 


1 p{n -J + ly f n y ■ 

aEF" l-p 


j = 1,... ,n + 1; 


Hence, the condition that < E/(y + 1) (for all increasing / : Z+ M) is 

equivalent to the non-negativity of 


E 


/(r + i) 


f p(n-Yr y 

\ {l-p)EY‘- jJ ’ 


( 2 . 8 ) 


p{n—YY 


for all / increasing. Noting that, by Lemma 12.91 (ii), E ± — 
covariance of two increasing functions, and is hence non-negative. 


= 0 , 


is the 

□ 


2.3.1 Ordering results for CMP distributions 

Throughout this section, let X CMP(A, z/). It is clear that =st X + 1 and hence, 
for u > 1, Lemma [2.101 gives X^) X -|- 1. This is the negative dependence condition 
employed by Daly, Lefevre and Utev [l^. Some consequences of this stochastic ordering 
are given in Proposition ^. 12l below. Before we can state these, we define the total variation 
distance between non-negative, integer-valued random variables U and V: 


dTvmU),C{V)) = sup |P(f/eH)-P(I/eH)|. 

A<ZZ+ 

We will also need to define the Poincare (inverse spectral gap) constant Ru for a non¬ 
negative, integer-valued random variable U: 


Ru = sup 


E[g{Uf 


g^giU) [E[{g{U+l)-g{U)f] 
where the supremum is take over the set 


g{U) = {c/ : Z+ ^ M with E[g{Uf] < oo and Eg{U) = O} . 
Propositiou 2.12. Let X ~ CMP{X, u) with u > 1. Let p = EX. Then 

(i). 

dTv{C{X), Po{p)) < - (/i - Var{X)) ~ "Yzl + Q , 

/i u ^ ' 

as X ^ oo. 


(ii). 


Var{X) < Rx < p ■ 
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(iii). X <cx Z, where Z ~ Po{fi). In particular, 

/ y. \ -(M+i) 

> /X + t) < e* f 1 + - j 
F{X < /i - t) < e"* (^1 - ^ , 


where the latter bound applies ift<p.. 

Proof. The upper bound in (i) follows from Proposition 3 of Daly, Lefevre and Utev 
The asymptotic behaviour of the upper bound is a consequence of our Proposition 12.31 and 
(12.4p . In (ii), the lower bound is standard and the upper bound is from Theorem 1.1 of 
Daly and Johnson 1^. (iii) follows from Theorem 2.2 and Corollary 2.8 of Daly jl^. □ 


On the other hand, if i/ < 1 we have that X + 1 <st X^'^K In that case. Proposition 3 


of Daly, Lefevre and Utev [15| gives the upper bound 


dTvmX), Po(/i)) < - (Var(X) - p) ~ + 0(A-"), 

/i u 

as A ^ oo, where p = EX and the asymptotics of the upper bound again follow from our 
Proposition 12.31 and fl2.4p . 

However, in the case u < 1 we cannot adapt the proof of Theorem 1.1 of Daly and 
Johnson [l^ to give an analogue of Proposition 12.121 (iii. By suitably modifying the proof 
of Theorem 2.2 of Daly [l^, we may note that Z <cx X in this case, where Z Po(a.). 
There is, however, no concentration inequality corresponding to that given in Proposition 

[2T2](iii). 


2.3.2 Ordering results for CMB distributions 


Now let Y ~ CMB(n,p, u). In the case u > 1 we may combine Lemmas 12.101 and 12.111 to 
see that <st U + 1. That is, the negative dependence condition holds. We thus have 
the following analogue of Proposition 12.121 which may be proved in the same way as that 
result. 


Proposition 2.13. Let Y ~ CMB{n,p, n) with u > 1. Let p = EX. Then 

(%). 

dMCiY), Po{p)) < - (p - VariY)) . 


(ii). 


VariY) <RY<fi. 


(iii). Y <cx X, where X ~ Po{p.). In particular, 

P(X > yu + t) < eM 1 + 




P(U < /i - t) < e-M 1 - 


-(m+0 


t—fL 




where the latter bound applies ift<fi. 
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There is no corresponding result in the case z/ < 1, since the stochastic ordering 
<st Y + 1 holds regardless of the sign of 1 — z/, and so we cannot use our previous 
lemmas to make a stochastic comparison between and Y + 1 when Y ~ CMB(rz,,p, u) 
with v <1. 


2.4 Example: v = 2 

For some illustration of our results, consider the case z/ = 2 and let X ~ CMP (A, 2). Let 
Ir{x) be the modihed Bessel function of the hrst kind dehned by 

OO ^ 

= E i!r(r + A+l) 



Note that, by dehnition, the normalizing constant Z(A,2) = Jo( 2\/A). Hence, for m G N, 

1 OO •« ^ 

/o(2\/A) 


From (12.2p we therefore have 


m ^ N 

- E: 

k=l ^ ^ 


m \ A"/Vfc(2x/A) 

WVx) 


In particular, the mean of X is given by 

e.y = AE(^ 

Io(2XX) 

Also, since = A, the variance is given by 


Var(X) = A 


h /i(27A)^ \ 

f U2^/\)^) 


Formulas for the cumulants, skewness and excess kurtosis of X can also be obtained, but 
their expressions are more complicated and are omitted. 

Note that the asymptotic formula Ir{x) ~ as x —)> oo (see Olver et ah [^) easily 

allows one to to verify flET|) in this case. Writing Var(X)_= E[X(X - 1)] + EX - (EX)^, 
the Turan inequality Ir{x)‘^ > Ir+i{x)Ir-i{x) (see Amos [l|) also allows direct verihcation 
that Var(X) < EX, which follows from the convex ordering in Proposition 12.121 The 
total variation bound in that same result may be expressed as 


dTv(E(X),Po(EX)) < yX 


/i(2^ 

Ioi2^/X) 


/2(2%/A) \ 
/i(2VA) J 
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3 Convergence and approximation for CMB distri¬ 
butions 


In this section we will use Stein’s method for probability approximation to derive an 
explicit bound on the convergence of V ~ CMB(n, A/n^, z/) to X ~ CMP(A, z/) as n —)> oo. 
This convergence is the analogue of the classical convergence of the binomial distribution 
to a Poisson limit, which corresponds to the case v = 1 here. 

Stein’s method was hrst developed by Stein 


33[ in the context of normal approxi- 

An 


mation. The same techniques were applied to Poisson approximation by Chen [91]. 
account of the method for Poisson approximation, together with a wealth of examples, 
is given by Barbour, Holst and Janson j^. Stein’s method has also found a large num¬ 
ber of applications beyond the classical normal and Poisson approximation settings. For 
an introduction to Stein’s method and discussion of its wide applicability, the reader is 
referred to Barbour and Chen and references therein. 

For future use, we dehne 




min |l, max |l -|- A, (1)^} (l — A ^ 

if z/ > 1 and A > 

(1 + i) + 

if z/ < 1 and A > 

1 

if z/ > 1 and A < 

(l-A^-^)"^ 

if z/ < 1 and A < 


We use much of the remainder of this section to prove the following. 

Theorem 3.1. Let Y ~ CMB{n, \/rY, u) for some 0 < A < rY, and let X ~ CMP{X, u). 
Then 


dTv(C(Y),C(X)) < X + (1 + Er) min {l, A"'}) 

+ ^ (EF + EF^) mill {1, A"'} , 

where = max{l, v} and gu{X) is as defined above. 

Remark 3.2. Taking A = n'^p in Theorem 13.11 gives the following bound for the total 
variation distance between the laws of T ~ CMB(rz,,p, z/) and X ~ CMP(n^p, z/): 


dTv{C{Y),C{X)) <n''p(^j^ ++ (1+ Ey)min{l, (n» ^}) 

-|- n'^~^pCi, (ET + ET^) min |l, . 

However, we prefer to work with the parameters given in Theorem 13.11 because in that 
theorem the CMP distribution (which we regard as the limit distribution) does not depend 
on n. 


Remark 3.3. For large n, the bound of Theorem 13.11 is of order n which is in 

agreement with the order of upper bound in the classical case z/ = 1 that is given by 
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Barbour, Holst and Janson [^. In fact, Barbour and Hall obtained a lower bound of 
the same order: 

— min |A, A^} n~^ < (iTy(Bin(? 7 ,, A/n), Po(A)) < min |A, A^} n~^. (3.1) 

32 

It would be desirable to obtain a corresponding lower bound for all v > 0 , although 
their method of proof does not generalise easily to z/ 7 ^ 1. We have, however, been able 
to get a good indication of the ‘true’ rate of convergence via a simple numerical study. 
We fixed A = 1 and considered a number of different values of v. For each i/, we used 
Mathematica to evaluate '■= dTv(CMB(n, z/), CMP(1, f)) for different values of 
n. The values of \og{dy^ri) were plotted against log(n) and the gradient of the line of best 
fit was used to estimate the exponent of n. The results of this study strongly suggest that 
the convergence is indeed of order general A, but see Remark 13.51 below 

for a choice of A that gives a faster rate). For example, in the case z/ = 1/2 the fitted 
gradient was —0.502, and the fitted gradient was —0.974 for z/ = 3/2. A direction for 
future research is to verify this assertion theoretically by obtaining a lower bound of this 
order. 

Remark 3.4. For general z/, we do not have closed-form formulas for the moments EH 
and EH^. However, EH^ EX^ for large n, and so we can use the asymptotic formula 
EX^ A^/^ to see that, for large A, the upper bound of Theorem 13.II is of order 

y2/u 

- 1 -. 

n n'^ 

For z/ = 1, this dependence on A is not as good as the 0(A) rate of (13. Ij) . 

Remark 3.5. In the special case A = EH'^, the rate improves to order n~^\ 

dTviOY), C(X)) < min {1, A} — (EH + EH^) . ( 3 . 2 ) 

n 

This bound can be easily read off from the proof of Theorem 13.11 

Remark 3.6. As we shall see, the proof of Theorem 13.11 relies on a stochastic ordering 
argument. Before arriving at this proof, we considered generalising the classical Stein’s 
method proofs of the Poisson approximation of the binomial distribution. These ap¬ 
proaches involve local couplings or size bias couplings (see Barbour, Holst and Janson 
p]). However, neither of these approaches generalise easily to the CMP approximation 
of the CMB distribution. The first step in generalising the classical proofs is to write 
Y ~ CMB(? 7 ,, X/rY, v) as a sum of Bernoulli random variables. However, these Bernoulli 
random variables are strongly dependent (see fll.41) ). and so local couplings are not ap¬ 
plicable. Also, the natural generalisation of the size-bias coupling approach involves the 
construction of the power-bias distribution of H, which we found resulted in intractable 
calculations. 

The starting point for applying Stein’s method is the characterisation of the CMP 
distribution given by Lemma [2.91 fib Using that, we have the representation 

dTv{ay),C-{X))= sup \XEfAiY + l)-E[Y^UiY)]\ 

ACZ+ 

= sup |AE/^(y + 1) - EWE/^(U("))| , (3.3) 

ACZ+ 
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where /^ : Z+ i—)■ M solves the Stein equation 


l{x e A) - P(X e A) = A/a(x + 1) - x"/a(x). (3.4) 

Hence, in proving Theorem 13.11 we hnd a bound on |AE/a(H + 1) — 
which holds uniformly in H C Z"*". In order to do this, we will need bounds on the 
functions solving 03.41) . These are given in Lemma [3.71 below, whose proof is deferred 
until Section [221 

Lemma 3.7. The unique solution of the CMP{X, u) Stein equation \3.Jifj is given by 
/yi(0) = 0 and, for j > 0, 


no +1) = E 6 ^) - ip(v 6 A)]. 

for H C Z+. The solution satisfies the bounds 

sup sup \fA{j)\ < 9uW , 

ACZ+ 

sup sup \fA{j + 1 ) -fA{j)\ < A~^ (l - Z(A,z/)"^) < min{l,A"^} 

ACZ+j&Z+ 


(3.5) 


(3.6) 

(3.7) 


where gu{X) is as defined above. 

Remark 3.8. The value of /a( 0) is in fact irrelevant, and we follow the usual convention 
and set it equal to zero (see Barbour, Holst and Janson [^, p. 6). Therefore, to be precise, 
the function /^(j), as given by 03.5p . is the unique solution of 03.4p for j > 1. 

For use in what follows, we dehne the forward difference operator A by A/(j) = 
f(j + 1 ) — /(j), and the supremum norm H-H by ||/|| = sup^ |/(j)| for all / : Z+ M. 
Now, we have that 

XEfA{Y + l)-EY^EfA{Y^''^) = EY"^ (E/a(H + 1) - E/^(y(^))) + (A - EH^) E/^(F + 1). 

Recall from Lemma [2. Ill that <st Y + 1. Hence, we may follow the methods of 
Daly, Lefevre and Utev [l5| and obtain 

\XEfA{Y + 1) - EY^EfA{Yn\ < (Ey")||A/^|| (l + EH - EH^")) + ||/^|| |A - EH'I 

= II A/^ll ((1 + Ey)EF" - EH'+i) + \\fA\\ \X - ERH , 

where we used 01.5p to note that EY’^'^^ = EY'^EY^’^f We may then combine the repre¬ 
sentation 03.3p with Lemma 1X71 to get 


drv(E(F), C{X)) < min {1, A"^} ((1 + EY)EY’^ - EW+i) + ^^(A) |A - EW 


(3.8) 


To complete the proof of Theorem 13.11 we use Lemmas l3.9l and l3.10l below. These make use 
of the characterisation of the CMB distribution. The idea of combining characterisations 
of two distributions when using Stein’s method has previously been employed by Goldstein 


and Reinert 19 and Dobler 16 
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Lemma 3.9. Let Y ~ CMB{n,X/n^,u). Then 


> X(l+EY {EY + Ey2) j , 


where Cy = max{l, v}. 

Proof. We use the characterisation of the CMB distribution given in Lemma 12.91 (ii) to 
note that 


Er^+^ = A f 1 - — ) E 

-1 


> A 1 - 




E 


(y + i)li- 

(r + i) fi- 


Y 

n 
CyY 
n 


> AE 


(r + i)li 


n 


Lemma 3.10. Let Y ~ CMB{n,X/n^,u). Then 

I f ^ vn^'EY 

|A -EW| < A f —-r + 


— A n{n'^ — A) 

Proof. Let p = A/n'^. Using Lemma |23] (h), 


□ 


A-EW 



ULiirf 

i-p j 


The result then follows by applying Taylor’s theorem to the function {1 — yY. □ 

Substituting the bounds of Lemmas 13.91 and 13.101 into (13.81) completes the proof of 
Theorem 13.11 


3.1 Remarks on Lemma 13.10 

We use this section to give some remarks related to Lemma [3.101 Firstly, note that the 
upper bound given in that lemma is of order 77 ,-which can in fact easily be seen 
to be the optimal rate. Using Lemma [3. Ill below, we show that a better bound is possible 
when A/n^ is small, although this improved bound will of course still be of the same order 
as the bound given in Lemma 13.101 

Lemma 3.11. Let Y ~ CMB{n,p,v) with n > 1. There exists p* G (0,1] such that for 

p < p* 

rp.yu \ <n’'p ifn>l, 

\ > n'^p if u < 1. 
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Proof. Let 


h{p) = EY’^ 




Elementary calculations show that h(0) = 0, h'(0) = n'^ and 

h"(0) = -2n^ {rP - (n - - 1) . 

Note that (since n > 1), h"(0) < 0 for i/ > 1 and h"(0) > 0 for z/ < 1. Using the continuity 
of h and Taylor’s theorem applied to h, the result follows. □ 


Consider now the case Y ~ CMB(n, A/n*^, z/) with z^ > 1. By Lemma 13.111 for n 
sufficiently large we have that A — EY'' > 0, and we may then follow the proof of Lemma 
13.101 to get the bound 


A _ < A 


vn^EY 
n {n^ — A) 



which improves upon Lemma 13.101 

A similar argument in the case u <\ gives that, for n sufficiently large, A — EY'^ < 0 
and 

A 2 

EY‘' - A <-- . 

rp — \ 


3.2 Proof of Lemma 13.7 


It is straightforward to verify that fl3.5p . denoted by solves the Stein equation fl3.4p . 

To establish uniqueness of the solution, we take j = 0 in fl3.4p . from which it follows that 
any function that solves the Stein equation fl3.4l) must satisfy ^^(l) = /pi( 1)- By 

iteration on XHaH + 1) - fhA{j) = X/aU + 1) - j^fAU) follows that hA{j) = JaU) 
for all j > 1, which conhrms the uniqueness of the solution. 

We now establish fl3.7p . By constructing X ~ CMP (A, z/) as the equilibrium distribu¬ 
tion of a birth-death process with birth rates aj = X and death rates fij = the hrst 
inequality of fl3.7p follows from Corollary 2.12 of Brown and Xia j8|. Since Z{X,u) > 1 
for all A and z/, it follows that A“^ (1 — Z{X, < A“^. Finally, 


X-\l-Z{X,u)-^) 


E oo 

j=0 ((j+l)!)- ^ ^ 

E OO \k — 

k=0 (k\)" 


This completes the proof of (13.7p . 

Remark 3.12. The upper bound A“^ (1 — Z{X, for the forward difference is attained 

by /{i}(2) -/{i}(l). 

It remains to establish (13.6p . We do this by considering separately four cases. Our 
strategy is to suitably generalise the proof of Lemma 1.1.1 of Barbour, Holst and Janson 
j^, which gives analogous bounds in the Poisson case = 1). 

Firstly, note that from fl3.7p and the choice /^(O) = 0, 


|/a( 1)| < min{l, A , 


(3.9) 


for each A C Z+. Given fl3.9p . we need only to show the stated bound on IJaU + 1)| for 
j > 1 in each of the four cases detailed below. 
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Case I: z/ > 1 and A > 1 

Note that by examining the proof of Lemma 1.1.1 of Barbour, Holst and Janson j^, it is 
clear that {JaU + 1)| < 5/4 for all j > 1 whenever u > 1. We can, however, do a little 
better. Barbour and Eagleson [^, Lemma 4, obtained the bound I/aU + 1)1 + 1 when 
u = 1. By examining their proof we see that the bound also holds for all z/ > 1. 

Now, let Um = {0,1,... ,m}. It is easily verihed that the solution Ja to the Stein 
equation fl3.4p is given by 

JaU + 1) = Z/)(CMP(A, Z/){H n Uj} - CMP(A, z/){H}CMP(A, p){Uy) 

= z/)(CMP(A, v){A n f/j}CMP(A, z^){17/} 

- CMP(A, v){A n t/;}CMP(A, iy){Uj}) , 

where CMP(A, z/){H} = P(X G A). Hence 

|/a(j + 1)| < A-^-'(j!)^Z(A,z.)CMP(A,z.){17,}CMP(A,z.){17/}, (3.10) 

with equality for A = Uj. 

Equation fl3.10p gives us two ways of bounding I/aU + 1)|- Firstly, note that 


r=0 


r \ - 1 

xj -x-f 


\fA{j + 1)1 < A-^-^(j!)^Z(A, z.)CMP(A, u){Uy = A'^ g A"^ 
and when < A, this may be bounded to give 

|U(, + l)|<A-‘t(A) 5^7 

Secondly, we also have 


|/x(j + l)|<A-'-‘(i!)“Z(A,OCMP(A,i/){q} = A-' A 

r=j+l 


r-j f J_ 

r! 


and when (j + 2)^" > A, this may be bounded to give 


|/a(j + 1)1 < t- 


E 


(a + i)‘'GsV(j + 2) 


A 


(j + 2)‘ 


a + 1)" (U + 2)- - A) 


(3.11) 


(3.12) 


(3.13) 


(3.14) 


Note that the bounds fl3.11l) - fl3.14l) hold for all values of u and A. We will also make use 
of these bounds in the other cases we consider below. 

Now, for f <X- we use fl3.12p to get that 


UaU + 1)1 < A'-'""-'. 

Similarly, when (j + 2y > A + we use (13.1 4p to get that 

IU(J + 1)I<(3/2)W''^-'-', 


(3.15) 


(3.16) 
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noting that (j + 2)^ > It remains only to treat the case \j'^ — A| < \l 2 v_ 

To that end, let A — < A, and use fl3.1ip to note that 

/ m j \ 

IIaU + 1)1 < a ^ I ^ Clr j , 

\r=0 r-=LBJ+l / 

for any B < j, where 

Note that la^l < 1 for each r G We choose 

B = [1 - (1 - A-^/2^)^/^] , 

so that Or+i/oj. < 1 — for all r > B. Hence, we have 

|/a(j + 1)| < A-i + 1 + = A'/"-' [1 - (1 - . (3.17) 

Note that, by Taylor’s theorem and since > 1, 

1 _ (1 _ a-i/2.^)1/- < ^ (1 - A-V2-)'/"-' a-1/2. _ 

Hence, 

ln(j + 1)1 < ("l + H (1 - A*/"-'. (3.18) 

Finally, we consider the case A < < A + From fl3.13p we have 


/ [CJ oo 

IIaU + 1)1 < a M ^ br + ^ hr 

yr=j+l r=\C\+l 

for C > j, where 



Analogously to before, we note that \br\ < 1 for each r, and we make the choice C = 
\^/i^ (l + so that hr+i/br < (1 + A“^/^'^) for r > C. We then get the bound 

IIaU + 1)1 < [(1 + A-^/2")^/" - 1] + . (3.19) 



Using Taylor’s theorem, 

(1 + A-^/2")^/" - 1 < , 

since z/ > 1, and so 

l/.4(j + l)l< + (3.20) 

Combining the bounds fl3.15l) . (13.161) . fl3.18p and fl3.20p we obtain the stated bound on 
||/yi|| in this case. 
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Remark 3.13. Recall fl3.10p . Taking j ^ and using Stirling’s formula and fll.2p . 
gives 

I/a(j + 1)| ~ (27r)('^-i)/2yi7’ 

for j PS and large A. Hence, a bound of order X^/‘^'^~^ is the best that we can expect 
for 11 /a 11 for large A. This order is achieved by Lemma f3.7l This remark also applies to 
Case II considered below. 

Case II: z/ < 1 and A > 1 

Here we use an analogous argument to that employed in Case 1. The bounds fl3.15l) and 
(I3.16P still apply; the only changes to our argument come for the cases where — A| < 

A1-1/2i^_ 

When A — X^~^/‘^'' < j'" < A, we again use (13.171) . Since z/ < 1 in this case, Taylor’s 
theorem gives 

1 - (1 - , 

from which it follows that 

in(j + i)i< (i + l) 

When X < < X + we use (I3.19p . noting that, since z/ < 1, 

(1 + - 1 < - (1 + y^-l/2. ^ 

V ^ ' 

giving 

UaU +1)1 < (i +1) (1 + . 

The stated bound follows. 

Case III: z^ > 1 and A < 1 

As before, we may use the proof of Lemma 4 of Barbour and Eagleson to obtain the 
bound |/A(j + 1)1 < 1 for all z/ > 1. 

Case IV: u < 1 and A < 1 

Here we again use fl3.13p . That bound gives us 

in(i + i)i<A-‘^^^(i^) Q (v-r<A-i:(A-T. 

Since A < 1 and z/ < 1, we have X^~^ < 1. Hence we get the bound 

IIaU + 1)1 < • 
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Remark 3.14. Consider the case 1 ^ = 0 and A < 1. We then have that our CMP random 
variable X has a geometric distribution, supported on Z"*", with parameter P(X = 0) = 
1 — A. Note that in this case, Lemma ISTI gives the bound W/aW < (1 — A)“h This was 
shown, in Remark 4.1 of Daly [l^, to be the correct dependence on A for such a bound. 


4 Other convergence and approximation results 

In this section we consider other convergence and approximation results related to CMP 
distributions. 

4.1 Sums of Bernoulli random variables 

In Section Owe have considered the convergence of the CMB distribution to an appropriate 
CMP limit. In this case we were able to derive an explicit bound on this convergence. 
Recalling fll.ip . we are able to write a CMB distribution as a sum of Bernoulli random 
variables (having a particular dependence structure), with each Bernoulli summand having 
the same marginal distribution. In this section we consider how we may generalise (ll.lh 
to a sum of Bernoulli random variables which are no longer exchangeable and yet give a 
CMP limiting distribution in an analogous way to the limit considered in Section [3l In 
this case, although we are able to prove convergence in distribution, we are unable to give 
an explicit bound on the convergence rate; further discussion is given in Remark 14.21 
Consider the following generalisation of fll.4p . Let Xi, ... be Bernoulli random 
variables with joint distribution given by 

P(Xi =Xi,...,Xn = Xn) = -^ Q j > 

where k = Xi + ■ ■ ■ + Xn and the normalizing constant C'^ is given by 

n ^ X u—l 

k=o ^ ' AeF^. ieA jeA‘^ 

where 

Fk = {A C {1,... ,n} : |4| = k} . 

We consider the convergence of the sum W = Xi + ■ —h Xn. It is easy to see that W has 
mass function 

W>{W = k) =PnAk]Pl,---,Pn) = 

for /c = 0,1,..., n. This distribution generalises the Poisson binomial distribution in a way 
analogous to the CMP and CMB generalisations of the Poisson and binomial distributions. 
We therefore say that a random variable with mass function fl4.ip follows the Conway- 
Maxwell-Poisson binomial (CMPB) distribution. Of course, the case i/ = 1 is the usual 
Poisson binomial distribution and the case pi = ■ ■ ■ = Pn = P reduces to the CMB(?7,,p, f) 
distribution. 
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Theorem 4.1. Let W = Xi -\— • + be as above, with mass function Pn,u{k] pi,, Pn) 
given by U.lD with Pi = ^ for i = 1,... ,n, where the Xi are positive constants that do 
not involve n. Then W converges in distribution to X CMP{X, v) as n ^ oo, where 

n 

A = lim - y Ai. 

71^00 Jl 


%=\ 


Proof. Firstly, note that in the case u = 1 the result is known. It is the classical con¬ 
vergence of a sum of independent Bernoulli random variables to a Poisson distribution. 
That immediately gives us the limit 

A^ 


AeFkieA jeA^^ ^ 

as n —)■ oo. As a consequence of fl4.2p . we have that 




(4.2) 


as n —)■ OO, since 


lim IT ( 1 — — ] = lim IT ( 1 — — ] • lim IT ( 1 — — ] = e 

n—^oo V Tl / V Ti / n^cxDV Tl / 

jeA<^ ^ ^ j=i ^ leA ^ ^ 

Now, in the present case we may write the mass function (14.11) in the form 

v-l 


A&Fh i&A 


A, 


(4.3) 


A; 


-1 


,-A 


n\ 


PnAk] 


Clearly 

n\ 


( i.\ 

n^oo [n — Kj 

and 



A 




n 

A&Fk i&A j&A 


= 1 




i=i 




leA 


A; 


-1 


Note that 


lim IT ( 1 — 

n—Vrvo \ 


IgA 


A; 




-1 


= 1 


and that the product 


n 

j=l V 

and the normalizing constant C'^ do not depend on k. Combining these observations with 
rs]), we have that 

n\k 

lim Pn,u{k-, XxjrP ,..., XnjrP) = 


{k\Y ’ 


where C does not depend on k. The result follows. 


□ 


23 








Remark 4.2. It would be desirable to extend Theorem 14.11 to include an explicit bound 
on the convergence rate, as was achieved in Theorem 13. II Such a bound could, in principle, 
be established by generalising the proof of that theorem. This approach would require one 
to obtain a Stein equation for the CMPB distribution, a generalisation of the stochastic 
ordering result of Lemma 12.111 to the CMPB distribution, and an appropriate extension 
of the moment estimates of Lemmas 13.91 and 13.101 This is a possible direction for future 
research. 


4.2 Mixed CMP distributions 


Finally, we also consider the case of a mixed CMP distribution. For a non-negative, real¬ 
valued random variable we say that W ~ CMP(,^, has a mixed CMP distribution 
if 


P(hF = j) = 




-E 






j e z+ 


We assume throughout that ^ is such that this expectation exists. 

Following the proof of Theorem l.C (for mixed Poisson approximation) in the book 
by Barbour, Holst and Janson [^, we use the characterisation in Lemma [2.91 (i), along 
with the bounds on the solution to the Stein equation given in Lemma 13.71 to obtain the 
following. 

Theorem 4.3. Let ^ be a non-negative random variable. Then 

dMCMPi^, u), CMP{\, u)) < (7.(A)E|e - A|, 
where gv{\) is as defined in Section\^ 
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